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LIMIT DEXTRIN ASE INHIBITOR PROMOTER 



1. FIELD OF INVENTION 

The invention relates generally to a DNA sequence identified from Hordeum vulgare 
which acts as a tissue specific promoter. The invention further relates to the use of said DNA 
sequence to regulate exogenous gene expression in the endosperm and aleurone tissues of the 
seed of a plant. 



2. BACKGROUND 

The expression of a gene is dependent upon its DNA sequence being transcribed into 
RNA by the action of RNA polymerase. To achieve this, the RNA polymerase must recognise 
and attach to a region of DNA sequence located upstream of (i.e. 5 ■ to) the gene coding sequence 
m order for transcription to be initiated. Such a region is termed the promoter of the gene The 
intnnsrc nature of the promoter sequence determines the circumstances and the manner in which 
the gene is expressed. 

There are, broadly speaking, four types of promoters found in plant tissues; constitutive 
tissue-specific, developmentally-regulated, and inducible/repressible. 

A constitutive promoter directs the expression of a gene throughout the various parts of a 
plant continuously during plant development, although the gene may not be expressed at the 
same level in all cell types. Examples of known constitutive promoters include those associated 
with the cauliflower mosaic virus 35S transcript (Odell et al, 1985, Nature 313 810-812) the rice 
actm 1 gene (Zhang et al, 1991, Plant Cell 3 1 155-1 165) and the maize ubiquitin 1 gene (Cornejo 
et al, 1993, Plant Molec. Biol. 29 637-646). 

A tissue-specific promoter is one which directs the expression of a gene in one (or a few) 
parts of a plant, usually throughout the lifetime of those plant parts. The category o f tissue- 
specific promoter commonly also includes tissue-preferred or tissue-enhanced promoters which 
may also direct expression at some level in tissues other than the preferred tissue. Examples of 
tissue-specific promoters known in the art include those associated with the patatin gene 
expressed in potato tuber and the high molecular weight glutenin gene exposed in wheat, barley 
or maize endosperm, 

A developmentally-regulated promoter directs a change in the expression of a gene in one 
or more parts of a plant at a specific time during plant development. The gene may be expressed 
m that plant part at other times during plant development at a different (usually lower) level, and 
may also be expressed in other plant parts. 
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An inducible promoter is capable of directing the expression of a gene in response to an 
inducer. In the absence of the inducer the gene will not be expressed. The inducer may act 
directly upon the promoter sequence, or may act by counteracting the effect of a repressor 
molecule. The inducer may be a chemical agent such as a metabolite, a protein, a growth 
regulator, or a toxic element, a physiological stress such as heat, wounding, or osmotic pressure, 
or an indirect consequence of the action of a pathogen or pest. A developmentally-regulated 
promoter might be described as a specific type of inducible promoter responding to an 
endogenous inducer produced by the plant or to an environmental stimulus at a particular point 
in the life cycle of the plant. Examples of known inducible promoters include those associated 
with wound response, such as described by Warner et al (1993), temperature response as 
disclosed by Benfey & Chua (1989), and chemically induced, as described by Gate (1995). 

A promoter sequence may comprise of a number of defined domains necessary for its 
function. The first of these defined domains comprises approximately 70 base pairs located 
immediately upstream of (5' to) the structural gene and forms the core promoter. The core 
promoter contains the CAAT and TATA boxes and defines the transcription initiation site for the 
structural gene. A series of regulatory sequences upstream of the core promoter constitute the 
remainder of the promoter sequence and determine the expression levels, the spatial and 
temporal patterns of expression, and the response to inducers. In addition some promoters 
contain sequence elements which act to enhance the level of expression, for example that from 
the pea plastocyanin promoter as described in International Patent Application, Publication No. 
WO 97/20056. 

Genetic modification of plants depends upon the introduction of chimaeric genes into 
plant cells and their controlled expression under the direction of a promoter. Promoters may be 
obtained from different sources including plants, fungi, bacteria and viruses, and different 
promoters may work with different efficiencies in different tissues. 

It may often be desirable to express introduced genes in a number of different tissues 
within a plant. For example, the expression of a resistance to a pathogen or pest, or tolerance to 
temperature extremes might be best expressed throughout all tissues in the plant. Similarly it 
might be desirable to ensure the expression of the transgenes at all times throughout the 
development of the plant. Also, a promoter which is expressed in a manner that is immune to the 
influence of inducers or repressors resulting from unforeseen environmental stimuli may also be 
useful to ensure the continued expression of a trait. For these purposes, the use of a 
"constitutive" promoter would be desirable. Examples of constitutive promoters include the 



CaMv 35S promoter. For cereals the ubiquitm' promoter is a constitutive promoter of choice 
(Christensen & Quail, 1996) 

However, in some instances it is more desirable to control the location of gene expression 
in a transgenic plant to enhance the effect of gene expression by ensuring that expression occurs 
preferentially in those tissues where the effect of the gene product is most efficacious. By the 
same argument, modulated expression can reduce potential yield loss by reducing the resource 
dram on the plant. Further advantages include limiting the expression of agronomically useful 
yet generally deleterious genes to specific tissues by localisation and compartmentaUsation of 
gene expression in cases where the gene product must be restricted to, or excluded from, certain 
tissues. For example, anther specific expression of the sue inhibitor genes (Mariani et al, 1990 
Nature 347 737) has been used in male sterility systems, whereas expression in other parts of the 
plant would result in toxicity. 

Examples of endosperm specific barley promoters include the high molecular weight 
glutenin promoter (Halford et al ). However, there is a lack of suitable endosperm specific 
promoters. 



3. SUMMARY OF THE INVENTION 

The present invention is directed to a DNA sequence identified from Hordeum vulgare 
(barley). 

The present invention provides a DNA sequence comprising a promoter sequence, the 
DNA sequence being selected from the group consisting of: (a) a DNA sequence, the sequence 
being known herein as SEQ. ID. No.l; (b) a DNA sequence comprising a portion of the 
sequence of the isolated DNA of (a), and being capable of regulating the expression of a gene- 
(c) a DNA sequence, which sequence is at least 95% , 90%, 85% 80%, 75% or 70% homologous 
to the DNA sequence of (a) or (b) above; (d) a DNA sequence, which sequence is at least 80% 
identical to the DNA sequence of (a) or (b) and which is capable of regulating the expression of 
a gene; (e) a DNA sequence comprising a portion of the sequences of the DNAs of any one of (a) 
- (d) above and being capable of regulating the expression of a gene; (f) a DNA sequence, which 
sequence ls complementary to the DNA sequence of any one of (a) to (e) above. 

Preferably the DNA sequence is capable of regulating expression of a gene which 
encodes a limit dextrinase inhibitor protein (LDI). 
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More preferably the DNA sequence regulates expression of the gene encoding the Limit 
dextrinase inhibitor-like protein in Hordeum vulgare. Most preferably the DNA sequence 
regulates expression of the gene encoding the Limit dextrinase inhibitor protein in the endosperm 
or aleurone tissues of developing seeds of Hordeum vulgare. 

The DNA sequences referred to herein may be isolated DNA sequences or, alternatively, 
may be synthesised DNA sequences. 

The present invention also provides a recombinant DNA comprising vector DNA and a 
DNA sequence as described in any one of (a) to (f) above. The recombinant DNA may suitably 
further comprise the DNA coding sequence of a gene. 

Preferably the recombinant DNA resides in a host cell suitable for transcription and 
translation. 

The present invention also provides a method of regulating the expression of a gene, the 
method utilising the DNA sequence as described in any one of (a) to (f) above operably 
associated with the coding sequence of a gene. 

The present invention also provides a transgenic plant, the cells of which plant comprise 
the DNA as described in any one of (a) to (f) above operably associated with a gene coding 
sequence. 

The present invention further provides a method of modifying the metabolism within the 
cells of a transgenic plant utilising the DNA sequence as described in any one of (a) to (f) above 
operably associated with the coding sequence of a gene. 

The present invention even further provides a method of producing a gene product 
within the cells of a transgenic plant utilising the DNA sequence as described in any one of (a) to 
(f) above operably associated with the coding sequence of a gene. 

The present invention also provides an oligonucleotide probe which selectively 
hybridizes to a DNA sequence as described in any one of (a) to (f) above. 

3.1 SEQUENCE IDENTIFIERS 

In the sequence listing: 

SEQ ID No. 1 shows the DNA sequence of the isolated barley limit dextrinase inhibitor 
promoter . 

SEQ ID No. 2 shows the nucleotide and derived amino acid sequence of the isolated barley limit 
dextrinase inhibitor protein. 
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SEQ ID No. 3 shows the derived amino acid sequence of the isolated barley limit dextrinase 
inhibitor protein. 

SEQ ID No. 4 shows the nucleotide sequence of the PCR primer inhib5. 
SEQ ID No. 5 shows the nucleotide sequence of the PCR primer inhib6. 
SEQ ID No. 6 shows the nucleotide sequence of the PCR primer API. 
SEQ ID No. 7 shows the nucleotide sequence of the PCR primer GSP1 . 
SEQ ID No. 8 shows the nucleotide sequence of the PCR primer AP2 
SEQ ID No. 9 shows the nucleotide sequence of the PCR primer GW1. 

4. BRIEF DESCRIPTION OF THE FIGURES 
In order that the invention may be readily carried into effect reference will now be made, 
by way of example, to the following diagrammatic drawings in which; 

Figure 1 shows the results of an RT-PCR analysis of barley limit dextrinase inhibitor expression. 
The RT-PCR analysis of LDI expression was performed using Inhib 5 and Inbib 6 primers. 
Legend: Mffind m molecular weight markers (X); complete barley grains 20 dpa (1); complete 
barley grains 40 dpa (2); endosperm of barley grains 14 dpa (3); aleurone layers of barley grains 
treated with gibberellic acid (4); negative control of RT-PCR (5); days post anthesis (dpa); 
Figure 2 shows a the results of Northern blot analysis of limit dextrinase expression. The top 
panel of the blot shows the RNA gel and the bottom panel of the blot shows the Northern blot. 
Northern blot analysis of LDI expression was performed using LDI RNA probe on different 
RNA samples. Legend: Leaves in the light (LL); leaves in the dark (LD); roots (r); post anthesis 
(pa.); 

Figure 3 shows the results of Southern blot analysis of limit dextrinase inhibitor in barley 
genomic DNA. The Southern analysis of barley ("GoldenPromise") DNA was performed using 
limit dextrinase inhibitor DNA as a probe. Legend: A/Hind HI molecular weight markers (X); 
DNA digested with EcoR V <E); DNA digested with BamHl (B); DNA digested with Xho I (X);' 
Figure 4 shows an isolation of limit dextrinase inhibitor promoter fragments by genome walking 
PCR. DNA was digested with Dral, Smal, EcoR V, Pvu H and Sea I. The primers used were 
AP2 and GW 1, and the PCR products analyzed on a preparative 1% TAE agarose gel. Legend: 
X/Hind mmolecular weight markers (X); negative control of PCR reaction (-ve). Base pairs 
(bp); 

Figure 5 shows a Barley transformation construct containing the limit dextrinase inhibitor 
promoter pCAMBIA1302 with LDI promoter; 
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Figure 6 shows a transient expression assay using the limit dextrinase promoter: GFP construct 
Fluorescent microscopy was performed on endosperm 2-4 weeks post anthesis transiently 
assayed with an LDI Promoter: GFP construct Legend: Non-bombarded controls (A C E and 
G); bombarded with LDI PromotenGFP construct and analyzed for GFP expression 24h and 48h 
after bombardment (B, D, F and H). Scaling bar is lOOum in length. 

Figure 7 shows the putative transcription start sites and TATA sequences (-25 bp) within SEQ. 
ID. No: 1 as predicted by neural network (M.G. Reese, NX. Harris & F.H. Eeckman. 1996- 
Large Scale Sequencing Specific Neural Network for promoter & slice site recognition' 
Biocomputing Proceedings of 1996 Pacific Symposium). The putative transcript start site is 
shown in larger font. 

Figure 8 shows the results of a full sequence search using the BLASTN algorithm for sequences 
similar to SEQ. ID. No. 1 of the present invention. The search results provided no significant 
homology with any known sequence in the four databases searched. 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to plant gene regulatory sequences. Specifically this 
invention relates to a promoter identified in Hordeum vulgare (barley). 

A plant DNA sequence may be recovered from the cells of the natural host or it may be 
synthesized directly in vitro. Extraction from the natural host enables the isolation de novo of 
novel sequences, whereas in vitro DNA synthesis generally requires pre-existing sequence 
information. Direct chemical in vitro synthesis can be achieved by sequential manual synthesis 
or by automated procedures. DNA sequences may also be constructed by standard techniques of 
annealing and ligating fragments or by other methods known in the art. Examples of such 
cloning procedures are given in Sambrook et al. 

The DNA sequence of the present invention may be isolated by direct cloning of 
segments of plant genomic DNA and screening the DNA sequence for the presence of diagnostic 
sequence motifs characteristic of known promoter sequences. Suitable segments of genomic 
DNA may be obtained by fragmentation using restriction endonucleases, sonication, physical 
shearing, or other methods known in the art. 

The i dentification of the cloned segment as a promoter s equence may alternatively b e 
achieved by assessing functionality, for example by linking the cloned segment with a coding 
sequence derived from a reporter gene and introducing the chimaeric construct into a host cell or 
cell-free system wherein expression of the reporter gene can be evaluated. This process may 
form part of another sequence isolation strategy termed promoter trapping, wherein genomic 



*~ ^ tot ° " eXPreSSi0n Ve ° ,0rS " ««■« a reporter gene coding 

regton and other sconces necessary for expression in a host oeU or cdl-free system The 
expresston may or may no, require integration of the chimeric instruct into the host's 
chromosomal DNA (Topping J F. e, aL (1991) and Topping JF. era/ (1994)) 

ide « fi ^ ""T" meth ° d ° f ° btaWngDNA "~ <>f «- Present invention is by me 
.d*foficauon and tsohttion of a DNA coding sequence which is known ,o he expressed and 
^sequentiy using this sequence to obtain me contiguous promoter sequenoe> ^ „ 
defauon dtrectmg me expression of me codmg sequence. A coding sequence may be obtain* 
by tte rsolahon of messenger RNA (mRNA orp„,yA + RNA) foom plan, tissue. The tissue used 
for RNA .solation is selected on me basis mat suitable gene coding sequences are behoved to be 
expressed in that tissue at optimal levels for isolation. 

Various mefoods for isolating mRNA from p.ant tissue are weli known fo those skilled in 
foe art, mclndmg for examp.e nsing an .Hgo-dT ohgonuCeotide unmobilised on an inert matrix. 
The .so.a.ed mRNA may be used fo produce its comptomentory DNA sequence (cDNA) by use 
of foe enzyme reverse transcriptase (RT) or ofoer enzymes having reverse tranche activity, 
isolation of an mdividua. cDNA sequence from a poo. of cDNAs may be achieved by cloning 
mfo bactena! or viral vectors, or by employing foe polymerase chain reaction (PCR) wifo 
selected ohgonucleotide primers. The ptoduction and isolation of a specific cDNA from mRNA 
may b^eved by a combination of foe reverse fransenption and PCR steps in a process known 

In order to isolate foe promoter sequence of foe present invention, a cDNA sequence to a 
Lnm. dexmnase inhibitor protein may be identified. Examples of ofoer Limit dextrinase inhibitor 
pmtont genes am given in Genbank Accession No.s X13443 for foe Hordeum ndgare alpha- 
am^sfo inhibitor; X99982. for foe Mtiam aesttvum cDNA for foe P OT88 protein . 

ZTf fo Z ^ ^ ° DNA &r * ^ 6C ' OT ******* 

X61032 for foe mncum ,un, m cDNA for foe alpha amyfase inhibitor protein; and X54064 for 

foe Zea mays cDNA for foe Hageman factor inhibitor protein 

Limit dexuinaseinhibitorproteins suitable for use in isolating the prmnoter of foe present 
mvention mclude proteins having foe amino acid sequence given as SEQ. ID. No;3 and proteins 
homologous to, and having essentially foe same bio.og.ca. properties as, foe protein disc.osed 
herem as SEQ. JO. No. 3. This definition is intended to encompass natnra. allelic variations in 
foe proton, pfos any non-aUehc examp.es. B wi.. be appreciated that foe amino acid sequence 
need not be tdentica. to foa. of SEQ. ID. No. 3; for foe pmposes of mis invention, foe amino acid 



sequence may be at least 80%, 90%, or 95% homologous or more with the protein of SEQ. ID. 
No:3 to retain its biological activity. General categories of potentially equivalent amino acids 
include, but are not limited to: glutamic acid and aspartic acid; lysine, arginine and histidine; 
alanine, valine, leucine and isoleucine; asparagine and glutamine; threonine and serine; 
phenylalanine, tyrosine and tryptophan; and glycine and alanine. 

The s elected cDNA m ay t hen b e u sed t o evaluate the g enomic features o f i ts g ene o f 
origin by use as a hybridisation probe in a Southern blot of plant genomic DNA to reveal the 
complexity of the genome with respect to that sequence. Alternatively, sequence information 
from the cDNA may be used to devise oligonucleotides and these may be used in the same way 
as hybridisation probes for PCR primers to produce hybridisation probes or for PCR primers to 
be used in direct genome analysis. 

Similarly the selected cDNA may be used to evaluate the expression profile of its gene of 
origin by use as a hybridisation probe in a Northern blot of RNA extracted from various plant 
tissues, or from a developmental or temporal series. Again sequence information from the 
cDNA may be used to devise oligonucleotides which can be used as hybridisation probes to 
produce hybridisation probes or directly for RT-PCR. 

The selected cDNA, or derived oligonucleotides, may then be used as a hybridisation 
probe to challenge a library of cloned genomic DNA fragments and identify overlapping DNA 
sequences. By this means a contiguous promoter may be identified and isolated. 

By the nature of the method of isolation, an isolated cDNA usually comprises the 3' 
terminus of the coding region and extends towards the 5' terminus. It may not comprise the full- 
length coding sequence. It is preferable to ensure that the 5' terminal sequence is present if the 
cDNA is to be used to identify the contiguous promoter. This may be achieved by extension of 
the cloned cDNA sequence in the 5' direction by a process termed 5' RACE. 

If sequence analysis of the cloned cDNA identifies a homologous sequence already 
reported in the scientific literature, this information may provide a suitable candidate sequence 
for the 5' terminus. However the possibility of there being different members of the same gene 
family with similar coding regions, but differing intron regions, promoter sequences and 
expression profiles may lead to the selection of an incorrect and unsuitable promoter sequence. 

Once the 5' terminus of the coding sequence has been identified, the contiguous upstream 
region containing the promoter may be isolated by further extension in the 5' direction. This 
may be achieved by methods including vector-ligation PCR, genome walking, vectorette PCR, 
and other methods. If necessary the process may be repeated with a new primer complementary 
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to the 5' terminus of the first promoter fragment to ensure that all the control sequences of the 
promoters are isolated. 

The present invention provides a DNA sequence which controls the expression of a Limit 
dextrinase inhibitor protein which DNA sequence is known herein as SEQ. ID. No. 1, and which 
may be used as a plant promoter. The present invention also includes within its scope DNA 
sequences which control the expression of other Limit dextrinase inhibitor protein genes and 
which are homologous with the DNA sequence of SEQ. ID. No. 1. 

Homology may be determined on the basis of percentage identity between two DNA (or 
polypeptide) sequences. "Percentage identity", as known in the art, is a measure of the 
relationship between two polynucleotides or two polypeptides, as determined by comparing then- 
sequences. In general the two sequences to be compared are aligned to give a maximum 
correlation between the sequences. The alignment of the two sequences is examined and the 
number of positions giving an exact nucleotide (or amino acid) correspondence between the two 
sequences determined, divided by the total length of the alignment multiplied by 100 to give a 
percentage identity figure. This percentage identity figure may be determined over the whole 
length of the sequences to be compared, which is particularly suitable for sequences of the same 
or very similar lengths and which are highly homologous, or over shorter defined lengths, which 
is more suitable for sequences of unequal length or which have a lower level of homology! 

For example, sequences can be aligned with the software clustalw under Unix which 
generates a file with the «.aln" extension, this file can then be imported into the Bioedit program 
(Hall, T.A. 1999. Bioedit: a user-friendly biological sequence alignment editor and analysis 
program for Windows 95/98/NT. Nucl. Acids. Symp. Ser. 41:95-98) which opens the .aln file. 
In the Bioedit window, one can choose individual s equences (two at a time) and align them. 
This method allows for comparison of entire sequences. 

Methods for comparing the identity of two or more sequences are well known in the art. 
Thus for instance, programs available in the Wisconsin Sequence Analysis Package, version 9.1 
(Devereux J. et al, Nucleic Acids Res. 12 387-395, 1984, available from Genetics Computer 
Group, Madison, Wisconsin, USA), for example the programs BESTFIT and GAP, may be used 
to d etermine the p ercentage i dentity b etween two p olynucleotides and the p ercentage i dentity 
between two polypeptide sequences. BESTFIT uses the "local homology" algorithm of Smith 
and Waterman (Advances in Applied Mathematics, 2 482-489, 1981) and finds the best single 
region of similarity between two sequences. BESTFIT is more suited to comparing two 
polynucleotide or two polypeptide sequences which are dissimilar in length, the program 
assuming that the shorter sequence represents a portion of the longer. In comparison, GAP 
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aligns two sequences finding a "maximum similarity according to the algorithm of Needleman 
and Wunsch (J. Molec. Biol. 48 443-454, 1970). GAP is more suited to comparing sequences 
which are approximately the same length and an alignment is expected over the entire length 
Preferably the parameters "Gap Weight" and "Length Weight" used in each program are 50 and 
3 for polynucleotide sequences and 12 and 4 for polypeptide sequences, respectively. 
Preferably, percentage identities and similarities are determined when the two sequences being 
compared are optimally aligned. 

Other programs for detennining identity and/or similarity between sequences are . also 
known in the art, for instance the BLAST family of programs (Karlin & Altschul, 1990, Proa. 
Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin & Altschul, 1993, Proc. Natl Acad. 
Sci. USA 90:5873-5877, available from the National Center for Biotechnology Information 
(NCB), Bethesda, Maryland, USA and accessible through the home page of the NCBI at 
www.ncbi.nlm.nih. ? ov ). These programs exemplify a preferred, non-hmiting example of a 
mathematical algorithm utilized for the comparison of two sequences. Such an algorithm is 
incorporated into the NBLAST and XBLAST programs of Altschul, et al., 1990, J. Mol. Biol. 
215:403-410. BLAST nucleotide searches can be performed with the NBLAST program, score 
= 100, wordlength = 12 to obtain nucleotide sequences homologous to a nucleic acid molecules 
of the invention. BLAST protein searches can be performed With the XBLAST program, score = 
50, wordlength = 3 to obtain amino acid sequences homologous to a protein molecules of the 
invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be 
utilized as described in Altschul et al., 1997, Nucleic Acids Res. 25:3389-3402. Alternatively, 
PSI-Blast can be used to perform an iterated search which detects distant relationships between 
molecules (Id.). When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the default 
parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http://www.ncbi Jl lm.nih.gov. Another preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, 1988, 
CABIOS 4:11-17. Such an algorithm is incorporated into the ALIGN program (version 2.0) 
which is part of the GCG sequence alignment software package. When utilizing the ALIGN 
program for comparing amino acid sequences, a PAM120 weight residue table, a gap length 
penalty of 12, and a gap penalty of 4 can be used. 

Another non-limiting example of a program for determining identity and/or similarity 
between sequences known in the art is FASTA (Pearson W.R. and Lipman D.J., Proc. Nat. Acac. 
Sci., USA, 85:2444-2448, 1988, available as part of the Wisconsin Sequence Analysis Package). 
Preferably the BLOSUM62 amino acid substitution matrix (Henikoff S. and Henikoff J.G., Proc. 
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Nat. Acad. Sci., USA, 89:10915-10919, 1992) is used in polypeptide sequence comparisons 
including where nucleotide sequences are first translated into amino acid sequences before 
comparison. 

Yet another non-limiting example of a program known in the art for detemining identity 
and/or similarity between amino acid sequences is SeqWeb Software (a web-based interface to 
the GCG Wisconsin Package: Gap program) which is utilized with the default algorithm and 
parameter settings of the program: blosum62, gap weight 8, length weight 2. 

The percent identity between two sequences can be determined using techniques similar 
to those described above, with or without allowing gaps. In calculating percent identity, 
typically exact matches are counted. 

Preferably the program BESTFIT is used to determine the % identity of a query 
polynucleotide or a polypeptide sequence with respect to a polynucleotide or a polypeptide 
sequence of the present invention, the query and the reference sequence being optimally aligned 
and the parameters Of the program set at the default value. 

In the context of the present invention "substantially homologous" sequences are those 
which have at least 45% sequence identity, preferably at least 50% sequence identity, more 
preferably at least 60%, 70%, 80%, or 90% sequence identity and most preferably at least 95% 
sequence identity with the described sequences. In some cases sequence, identity may be 98% or 
more preferably 99%, or more. 

The present invention also includes DNA sequences which hybridize to the above 
isolated DNA, including partial sequences and complementary sequences. Conditions under 
which such sequences will so hybridize can be determined in a routine manner. For example, the 
present invention includes within its scope, DNA sequences which hybridise to the DNA 
sequences of the present invention under stringent conditions. In the context of the present 
invention, "stringent conditions" are defined as those given in Martin et al (EMBO J 
4:1625-1630 (1985)) and Davies et al (Methods in Molecular Biology Vol 28: Protocols for 
nucleic acid analysis by non-radioactive probes, Isaac, P.G. (ed) pp 9-15, Humana Press Inc., 
Totowa N.J, USA)). The conditions under which hybridization and/or washing can be carried out 
can range from 42°C to 68°C and the washing buffer can comprise from 0.1 x SSC, 0.5 % SDS 
to 6 x SSC, 0.5 % SDS. Typically, hybridization can be carried out overnight at 65°C (high 
stringency conditions), 60°C (medium stringency conditions), or 55°C (low stringency 
conditions). The filters can be washed for 2 x 15 minutes with 0.1 x SSC, 0.5 % SDS at 65°C 
(high stringency washing). The filters were washed for 2 x 15 minutes with 0.1 x SSC, 0.5 % 



SDS at 63°C (medium stringency washing). Forlw stringency washing, the filters were washed 
at 60°C for 2 x 15 minutes at 2 x SSC, 0.5% SDS. 

The present invention also includes DNA sequences which hybridize to oligonucleotide 
probes. Preferably the DNA sequences hybridize to oligonucleotide probes under stringent 
conditions. In instances wherein the nucleic acid molecules are oligonucleotides ("oligos") 
highly stringent conditions may refer, for example, to washing in 6x SSC / 0.05% sodium 
pyrophosphate at 37°C (for 14 base oligos), 48°C (for 17 base oligos), 55°C (for 20 base oligos) 
and 60°C (for 23 base oligos). 

The gene coding sequence employed in carrying out the present invention may be active 
m some or all plant tissues. The sequence employed may encode a protein or an RNA moiety 
Through recombinant DNA techniques the sequence may encode a synthetic variant of a protein 
or RNA, a partial sequence, or a composite sequence comprising regions from one or more 
genes. F or example, the chimaeric gene may encode a polyprotein. T he sequence may also 
comprise repeated, inverted or complementary sequences, to achieve disruption of the 
transcription and translation of an endogenous gene or genes by, for example, antisense or RNA 
inhibition technology. 

The sequence known herein as SEQ. ID. No. 2 includes a coding sequence therein for the 
gene encoding a limit dextrinase inhibitor protein. In addition, many plant, bacterial and viral 
genes may be actively expressed. Exemplary genes encode the GUS, GFP and luciferase 
enzymes which may be used as reporter genes for promoter function. In addition the LDI 
promoter may be used to direct the expression of DNA sequences designed to alter the 
metabolism of the endosperm of the seeds of cereal plants, especially barley. Such genes include 
those for carbohydrate metabolism, starch metabolism, amino acid and protein metabolism and 
lipid metabolism. 

The gene product of the present invention can be produced by recombinant techniques 
wherein genomic DNA clones or cDNA clones for the DNA coding sequence are produced 
isolated, proliferated, and incorporated into a plant transformation vector of the present 
invention. 



Plant transformation vectors 

Plant transformation vectors of the present invention will contain "expression cassettes- 
comprising 5>-3> in the direction of transcription, a promoter sequence as described in the 
present invention, a gene coding sequence as discussed above and, optionally a 3' untranslated 
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terminator, sequence including a stop signal for RNA polymerase and a polyadenylation signal 
for polyadenylase. 

The promoter sequence may be present in one or more copies, and such copies may be 
identical or variants of the promoter sequence as described above. S uch copies may also be 
complete or partial sequences as described above. 

The terminator sequence may be obtained from plant, bacterial or viral genes. Suitable 
tenninator sequences are the pea rbcS E9 terminator sequence, the nos terminator sequence 
derived from the nopaline synthase gene of Agrobacterium tumefaciens and the 35S terminator 
sequence from cauliflower mosaic virus, for example. A person skilled in the art will be readily 
aware of other suitable terminator sequences. 

The expression cassette may also comprise a gene expression enhancing mechanism to 
increase the strength of the promoter. An example of such an enhancer element is that derived 
from a portion of the promoter of the pea plastocyanin gene, and which is the subject of 
International Patent Application, Publication No. WO 97/20056. 

These regulatory regions may be derived from the same gene as the promoter DNA 
sequence of the present invention or may be derived from different genes, e.g. from Hordeum 
vulgare or other organisms. All of the regulatory regions should be capable of operating in cells 
of the tissue to be transformed. 

The gene coding sequence may be derived from the same gene as the promoter DNA 
sequence of the present invention or may be derived from a different gene, e.g. from Hordeum 
vulgare or another organism. 

The expression cassette may be incorporated into a basic plant transformation vector, 
such as, binary or super-binary vectors, for example, pBIN19 and pBIN+ and other suitable plant 
transformation vectors known in the art. 

In addition to the expression cassette, the plant transformation vector will contain such 
sequences as are necessary for the transformation process. These may include the 
Agrobacterium vir genes, one or more T-DNA border sequences, and a selectable marker or 
other means of identifying transgenic plant cells. 

Production of the gene product 

Expression of a DNA coding sequence in the plant host cell will produce an RNA 
transcript If the coding sequence is derived from a structural gene, the RNA transcript is then 
translated into a protein gene product. If desired, the gene product can be isolated by standard 
techniques for isolating proteins from biological systems, such as salt precipitation, column 
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chromatography, immunoaffinity techniques, electrophoresis, recrystallisation, centrifugation, 
and such like. 



Example 1. Isolation of barley limit dextrinase inhibitor cDNA fragments. 

Cloning ofLDIgene by RT-PCR 

Total RNA was extracted from barley (var. Golden Promise) grains 2 and 4 weeks post 
anthesis with a LiCl method as described by (Cathala et al., 1983). 

2 ug of RNA was treated with Rnase-free DNase I (Amersham Pharmacia Biotech) and 
used to synthesize 30 ul first strand cDNA using random hexamer primers (Roche) and M-MLV 
reverse transcriptase (Promega) using the reaction conditions recommended by the manufacturer. 

It will be recognised by one skilled in the art mat other mRNA extraction and cDNA 
synthesis methods exist which could be employed to produce cDNA from tissue of Hordeum 
vulgare. 

A 3 ul aliquot of the cDNA product was used in a standard PCR reaction containing 2 
mM MgCl 2 , 8% (v/v) DMSO and primers Inhib-5 (5'- 

ACCAATAAACTAGTATCAACAATGGCATCCGACCA-3' SEQ ID No 4) and Inhib-6 (5'- 
CCAACCTrTTTTATTCATCAATCGGCCACA-3' SEQ ID No 5), which were designed 
against the Hordeum vulgare Limit dextrinase inhibitor sequence (Genbank Accession no. 
X13443.), using Taq polymerase (Bioline) as recommended by the manufacturer. The PCR 
program used was 5 min at 94 °C, followed by 35 cycles of 94 °C for 30 sec, 63.5 °C for 30 sec 
and 72 <>C for 1 min, finalized by 7 min at 72 °C. The product length was 623 bp. The amplified 
product was cloned into a pGEM-T Easy vector (Promega) and verified by sequencing. 

It will be recognised by one skilled in the art that other methods exist which could be 
employed to produce cloned DNA fragments. 

The sequence of this cDNA clone is shown in SEQ ID No 2. Compared to the published 
sequence (Genbank Accession no. X13443.), there are five single base pair substitutions, which 
in turn lead to two amino acid substitutions. 

Example 2. RT-PCR analysis of Limit Dextrinase Inhibitor expression. 

Total RNA was isolated from barley tissues using a LiCl method (M.-J.Cho et al, 1998). 
2 ug of total DNAse treated RNA was used to produce cDNA with M-MLV reverse 
transcriptase. 1/10 of this cDNA was used to run a PCR reaction with specific primers for the 
entire coding region of LDI. 
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The expression of the LDI gene was investigated by using RT-PCR with the specific 
primers Inhib 5 (SEQ ID No. 4) and Inhib 6 (SEQ ID No. 5 ) on RNA from developing grains. 
Expression of LDI was found in grains 20 and 40 days post anthesis, the endosperm and 
gibbereUic acid treated aleurone layers. The highest level of expression was found in the 
endosperm. As shown in Figure 1 . 

Example 3. Northern analysis of Limit Dextrinase Inhibitor expression. 

Total RNA was isolated from developing and genninating barley grains, barley leaves in 
the light or dark, and barley roots using a LiCl method (M.-J.Cho et al., 1998). 5 ug of total 
RNA were separated on a 1 % agarose gel with formaldehyde. The RNA was blotted over night 
with 1 Ox SSC onto a positively charged Nylon membrane and UV crosslinked. The blot was 
then prehybridized in DIG-EasyHyb and hybridised with 100 ng/mlDIG-labeled RNA probe at 
68 °C over night. The blot was washed once in 2x SSC, followed by 2 washes with 0.2x SSC at 
68 °C. After that the blot was blocked for 1 hour at RT in blocking reagent. This was followed 
by 30 minutes in 1:10,000 dilution of anti-DIG antibody solution. To remove excess antibody 
the blot was washed 4x in Tris buffer with salt, then equilibrated in phosphatase buffer and the 
chenuluminiscent substrate (CDP-star) added. Chenfiluminescence detection using X-ray film 
was performed. Northern blotting and hybridisation was carried out as detailed by manufacturers 
of the DIG label (Roche). 

Expression of LDI was only detected in early development of barley grains (2 and 4 
weeks post anthesis). No expression was found in germinating grains and vegetative tissue as 
shown in Figure 2. 

Example 4. Southern blotting 

Genomic DNA of barley leaf tissue was isolated according to the method of 
AW.MacGregor et al, 1995 and S.L.Delaporta et al, 1983. 10 ug digested genomic DNA 
(isolated using the Ambion Phytopure Kit according to manufacturer's instructions) was 
separated on a 0.8 % TAE agarose gel at 20V over night. The DNA was depurinated in 250 mM 
HC1 for 10 minutes, denatured for 30 min in 0.4 M NaOH and neutralized before blotting for 8 
hours onto a positively charged Nylon membrane (Roche) in 20x SSC. The DNA was UV 
crosslinked and prehybridized in DIG-Easyhyb for 1 hour at 52 °C and hybridised with 40ng/ml 
DIG labeled DNA probe at 52 °C over night. Washing and detection was carried out as described 
before for Northern blotting except for a more stringent wash with 0. 1 x SSC. 
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Southern blot analysis was used to exarnine the number of gene copies in wildtype (wt) 
"Golden Promise" barley. The exact number of copies of LDI in wildtype "Golden Promise- 
barley is hard to estimate as several bands per digest are visible. The most likely explanation is 
that there is a small gene family of related genes present in barley, as shown in Figure 3. 

Example 5. Isolation of the limit dextrinase inhibitor promoter region by genome walking. 

Genomic DNA of barley leaf tissue was isolated according to the method of 
A.W.MacGregor etal., 1995 and S.L.Dela P orta et al., 1983. 2.5 ug genomic barley DNA (variety 
"Golden Promise") was digested with the restriction enzymes Sea I, Pvu n, EcoR V, Sma I or 
Dra I respectively. These DNA fragments were then ligated to an asymmetric adapter and used 
as a template for PCR reactions with adapter primers and gene specific primers to isolate the 5' 
region of the LDI gene. All procedures were carried out as described in the Clontech manual. 
The first PCR was carried out using primers API (SEQ ID No. 6 ) and GSP1 (SEQ ID No. 7 ). 
lul of this 50 ul PCR reaction was used as template for the secondary (nested) PCR using 
primers AP2 (SEQ ID No. 8 ) and GWl (SEQ ID No. 9 ). 

Using the procedures described above, two candidate fragments were isolated, from 
digests with Sma I and Pvu H, as shown in Figure 4. The DNA fragments from the Sma I (-1000 
bp) and Pvu H (~ 650 bp) digests were purified and cloned into the Topo 4 vector and sequenced. 
Both fragments (Sma I and Pvu H) showed the same A and T rich sequence, spanning the 790 bp 
long 5 ' region of LDI. The promoter sequence is shown in SEQ ID No. 1 . S EQ. ID. No. 1 
discloses a nucleic acid sequence of 833 bp, within which sequence there is the 790 bp promoter 
sequence of the present invention. The sequence also contains therewithin the 5' untranslated 
region of the gene coding sequence, up to and including the first codon of the gene. 

Example 6. Transient Expression of Limit Dextrinase Inhibitor Promoter marker gene 
constructs. 

In order to investigate if the 790 bp region upstream of the LDI gene was sufficient as a 
promoter, a vector containing the putative promoter region plus a reporter gene was constructed. 
It consists of the 790 bp Sma I fragment, the GFP gene (green fluorescent protein) and a 
terminator (NOS). This construct (Figure 5) was used for transient assays on barley endosperm 
(2-4 weeks post anthesis) and for stable barley transformation. Endosperms were transformed 
with a biolistic device using .construct DNA coated gold particles and analysed by fluorescence 
microscopy after 24 and 48 hours post bombardment. The results in shown in Figure 6 
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demonstrate the expression of GFP, showing that the isolated genomic sequence acts as 
promoter. 



Example 7. Stable expression of Limit Dextrinase Inhibitor Promoter marker gene 
constructs. 

Stable barley transformation of immature embryos of the barley variety "Golden 
Promise" was performed with a biolistic device using DNA coated gold particles. Selection and 
regeneration procedures were performed as described by Y.Wan & P.GXemaux, 1994. 
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SEQUENCE LISTING 

<110> Advanced Technologies (Cambridge) Ltd. 

<120> Limit Dextrinase Inhibitor Promoter 

<130> RD-ATC-33 

<140> 
<141> 

<160> 9 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 833 
<212> DNA 

<213> Hordeum vulgare 
<400> 1 

atcaSaact tatS??^ 9Cgtggtcga osgcecggga tgtf ttgga caccaaatgt 60 
atcataaact tgttttttca ccgacaaaat attgctcctc catttcgcat taaaattatc i 5 n 

taStStt SES*? -^aacatt cataaaaaaa atattt?tta aglaaacltt III 
tccaatatac ccKSf tcatctggga gcatgtgctt ccggaagcca aaatgcccct 240 
cataa^acat ttafS** agaaacccct tctttcctaa aaatatatat catcgtccgt 300 
catgatacgt ttatgtattc aacgaaaaat attttcgcat gtcaccaaaa atattttata 3 so 
aa^tSaca T*™*^ gataaactcc ctcgtgttaa ctattttttc" tg? 420 
aaggatgaca atcaaaacaa aaatgtagac tgtaaacaaa gaaaacatta tttcctaaaa III 
ataaaaaaaa agattagagg gatatgtatt gtcgaaacac atgaggacta" gaJcaaJKa III 
aaaagggaaa tgagaaggaa aaaaggggta accattaccc aaagalaaca giaaatSfc 600 
tagacgtgtc gaagggaaac ggagtttgca ggggcgttcc aaa?tcagtt gcaafalcSt 660 
ccaaataaac gccaacaaga aagaaatgag cattacttgc gcgctttgca ctcKatctc ill 
tagcatctcc cgatacatac atacatgtag cctagctgca gaLttgJat agcStctt ill 
gcccaccagg ccaagagatt gaaccaacga ccaataaact agtatcaaca ^g ttCtt ™° 

<210> 2 
<211> 517 
<212> DNA 

<213> Hordeum vulgare 

<220> 

<221> CDS 

<222> (14) . . (457) 

<400> 2 

actagtatca aca atg gca tec gac cat cgt cgc ttc gtc etc tec ggc 49 

Met Ala Ser Asp His Arg Arg Phe Val Leu Ser Gly 
1 5 10 



111 v!? f 9 r ^ Cg 9tC ° tC 9CC gtc 9 CC 9 CC 9 CC acc ctg gag age 97 
Ala Val Leu Leu Ser Val Leu Ala Val Ala Ala Ala Thr Le2 111 sS 

15 20 25 

£? ff 9 ? aC Hf 9 t9C Caa cca g 99 9 fc 9 9 ac fct ° =cg cat aac ecg tta 145 
Val Lys Asp Glu Cys Gin Pro Gly Val Asp Phe Pro His Asn Pro LeS 

lit ?hr S£ utf £° I 3C 9tg ^ ta 333 C " 9tC tgC g 9 c C 9 C 99* <=cc 193 
Ala Thr Cys His Thr Tyr Val He Lys Arg Val Cys Gly Arg Gly Pro 

50 55 60 



v 1 

= a = a 2 r a ! s h si 5 s s s a s s 241 



65 70 



75 



SS Pro SE ?° ^ 9 t9C 9a9 9Cg Ctg * tc etc atg gac ggg 289 

Val Pro Asp His Cys Arg Cy S Glu Ala Leu Arg He Leu Met Lp g? 

85 90 

S3 s S S Su If? fg r a ! ss ss *s S 35 Sp SI " 7 

100 105 

cgt gac tgc ccg agg gag gag cag agg gcg ttc gee qcc aca etei- a*** ,»«: 
Arg Asp cys Pro Arg Glu Glu Gin *g fla Phe Ala £ £* £eu 



acg gcg gcg gag tgc aac eta teg tec gtc cag gag ccg gga gta coc 433 
Thr Ala Ala Glu Cys Asn Leu Ser Ser ?al Gin Glu Pro Gly ?2 Sg 

. 130 135 14u 

Hi ?af Leu Leu S Up Sy ^ 3 cgccaaggt aatgaagegg 487 



145 

agtactgtat acagaataaa agtactcgag 

<210> 3 
<211> 147 
<212> PRT 

<213> Hordeum vulgare 

<400> 3 
Met 
1 



517 



Ser 


Asp His 
5 


Arg 


Arg 


Phe 


Val 


Leu 


Ala Val 


Ala 


Ala 


Ala 


Thr 




20 








25 


Pro 


Gly Val 


Asp 


Phe 


Pro 


His 


35 






40 




Val 


He Lys 


Arg 


Val 


Cys 


Gly 








55 






Lys 


Glu Arg 


Cys 


Cys 


Arg 


Glu 






70 








Cys 


Glu Ala 


Leu 


Arg 


He 


Leu 




85 








Arg 


Val Val 


Glu 


Gly 


Arg 


Leu 




100 






105 


Glu 


Gin Arg 


Ala 


Phe 


Ala 


Ala 


115 








120 




Leu 


Ser Ser 


Val 


Gin 


Glu 


Pro 








135 






Gly 













10 15 
jeu Glu Ser Val Lys Asp 
30 

Isn Pro Leu Ala Thr Cys 
45 

vaj. ne Lys Ara Val Cvs Glv lr« nixr t 

50 « 60 

vaj » Arq Cvs Cvs &ra filn t. &1 i si. i 

65 *Tn ^ c 

75 80 
4et Asp Gly Val Arg Thr Pro 
90 95 
Sly Asp Arg Arg Asp Cys Pro 
110 

Thr Leu Val Thr Ala Ala Glu 
125 

JJCU oer ser val Gin Glu Pro Gly Val Aro 

130 145 
Asp <j±y 

145 



<210> 4 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: PGR primer 
<400> 4 

accaataaac tagtatcaac aatggcatcc gacca 

<210> 5 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR primer 
<400> 5 

ccaacctttt ttattcatca atcggccac 

<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 6 

gtaatacgac tcactatagg gc 

<210> 7 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR primer 
<400> 7 

agacccgttt tatcacgtag gtgtgg 

<210> 8 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR primer 
<400> 8 

actatagggc acgcgtggt 

<210> 9 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



35 



29 



22 



26 



19 



<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 9 

agacggcgcc ggagaggacg aagcga 
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Figure 1. RT-PCR analysis of barley limit dextrinase inhibitor expression. 



Figure 2. Northern blot analysis of limit dextrinase expression. 
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Figure 3. Southern blot analysis of limit dextrinase inhibitor in barley genomic DNA. 
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Figure 4. Isolation of limit dextrinase inhibitor promoter fragments by genome walking PCR. 
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Figure 5. Barley transformation construct containing the limit dextrinase inhibitor promoter. 
£coM - 10579 




„ 4/21 

Figure 6. Transient expression assays using the limit dextrinase promoter: GFP construct. 
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Figure 7. 



Start 


End 


Score 


247 


297 


0.89 


273 


323 


0.95 


652 


702 


0.83 



Promoter Sequence 

atgccccgtgtaaaagaaaccccttctttcctaaaaatatAtatcatcgt 
tttcctaaaaatatatatcatcgtccgtcatgatacgtttAtgtattcaa 
caagaacctccaaataaacgccaacaagaaagaaatgagcAttacttgcg 
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Figure 8 



Sequences producing significant alignments: 

gi 1 18868 | emb (X13443.1 1 HVAATI Barley mRNA for alpha -amylase/ . . . 

gi|22035404|gb|AY124482.l| Danio rerio myogenin gene, exons... 

gi| 24614584 |gb|AY050653.l| Giardia intestinalis SEC24 (SEC2 
gi|l0938038|gb|AF162890.l|AF162890Sl Mus musculus peroxisom! ! ! 



Score 



gi|5821237|dbj |AB021922.1 
gi I 3721562 | dbj | AB011276 . 1 



gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 



30313388 | gb | AY099112 
18140057 jgb | AF457660 
4104807 
8050595 
6690643 
6164589 
3643823 _ 

19919401 1 gb | AF435445 
8132114 |gb|AF153014 . 
4456992 |gb|AF077743 • 



•11 
.11 



Homo sapiens gene for lectin- lik. 
Mus musculus gene for alphala ca. 

Rattus norvegicus obese protein . 



gb 


AF039526 


.1 


AF039526 


gb 


AF233737 


.1 


AF233737 


gb 


AF191544 


.1 


.AP191544 


gb 


AF051769 


.1 


AF051769 


gb 


AF075270 


.1 


AF075270 



Homo sapiens MHC class I . 
Agrotis ipsilon AiC5 chym. 
Homo sapiens estrogen rec. 
Homo sapiens hyaluronidas . 
Hordeum vulgare high affi. 
1| Pleurotus ostreatus manganese pe. 
1| Trichomonas vaginalis Tvpl4 (tvpl. 
1|AF077743 Mus musculus transcriptio. 

gi|l2276207|gb|AF269146.l|AF269146 Bilophila wadsworthia ta. 
gi|14275833|emb|AJ289605.l|MMU289605 Mus musculus partial L. 

gi|3378604|emb|AJ009889,l|HSAJ9889 Homo sapiens GHR gene V 
gi|391623l|gb|AF074397.l|AF074397 Homo sapiens anti-mulleri 
gi|513 9506|emb|Z!88 92.2|MMDESMINP Mus musculus desmin gene 

gi 1 15487305 |dbj |AB060299.l| Mus musculus gene for acetyl Co. 



gi 

gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 

gi 
gi 
gi 
gi 
gi 

gi 
gi 
gi 



12697590 
25453365 
13183059 
13625520 
17105179 
16209547 
15213480 
15558849 



dbj 

gb 
gb 

gb 

gb 
gb 
gb 



AB046716 
AY050651. 
AF237414. 
AY014277. 
AF439558. 
AY052528. 
AF241535. 



-I Homo sapiens hST3Gal I gene for. 
Giardia intestinalis MYB (MYB) m. 
Anaplasma phagocy tophi lum RNA po. 
Lolium perenne gibberellin 2 0 -ox. 
AF439558 Mus musculus X2CR1 gene, . 

Glycine max calmodulin isof orm-4 . 
AF241535 Homo sapiens mucin 4 (MU. 
emb|AJ310936.l|FSO310936 Fusarium solani chy ge. 
4878023|gb|AF131239.2|AF131239 Rattus norvegicus alpha 1. 

15216031 1 emb | AJ318812.1 1 VFA3 18812 Vicia faba var. minor 
4454294 ' " 
3133088 
6690521 
5731977 



emb|AJ132779.l|MMU132779 Mus musculus pl07 gene 
emb|Y16736.l|HSA16736 Homo sapiens dif-2 gene, p! 
gblAFl54245.l|AF154245 Rattus norvegicus chemota, 
gb|AF114032 .1|AF114032 Mus musculus glycogenin-l , 



26453412 |dbj | AB094665.1 | Seriola quinqueradiata YGHL1 ge 
4039145|gb|AF099083.l|AF099083 Homo sapiens growth hormo. . 
2739123 |gb|AF029214.l|MMOX2Sl Mus musculus MRC OX-2 anti 

gi|2895903|gb|AF046916.l|AF046916 Ruminococcus flavefaciens 
gi|l916583|gb|U53907.l|RNU53907 Rattus norvegicus microsate.. 



(bits) 


Value 


82 


2e-12 


80 


6e-12 


76 


le-10 


76 


le-10 


76 


le-10 


74 


4e-10 


72 


2e-09 


72 


2e-09 


72 


2e-09 


72 


2e-09 




2e-09 


72 


2e-09 


72 


2e-09 


70 


6e~09 


70 


6e-09 


70 


6e-09 


70 


6e-09 


70 


6e-09 


7ft 


C a. AO 

be-09 


. 70 


6e-09 


70 


6e-09 


70 


6e-09 


i u 


oe-09 


68 


2e-08 


68 


2e-08 


68 


2e-08 


o o 


^5 tJ \J O 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 


68 


2e-08 
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9i|l334579 2 |gb|AP33275 9 .l| Homo sapiens partially duplicate 

»«?s:isgsss:i ?i:s: a % r enta^ 

emb|AJ277249.l|H S A277249 Homo sapiens p2Sa^! 
gi|l518845|gb|U63899.l| M MU63899 Mus musculus Girk2 gene, pr. 
gi|7109 2 84|gb| A P227507.l| Rattus norvegicus intestinal alka. 
gill2744733lgblAF318503.ll Danio rerio Myod (myod) gene, co. 

gi 
gi 
gi 
gi 



gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 



2581766 
2764739 
7579914 



14251200 I gb 
14043019 | gb 
2342636 I erab 



AF220499,2 AP220499 Acidithiobacillus ferroo* 
AF221946 ; 2 |AP221946 -Rickettsia- rickettsii ce' 
YlieSB.llMMyneas M.musculus CYP4A14 gene , e '. 

gi j 18873678 |emb | AJ2725'07.1|HSA272507 
gi I 2564335 | dbj | AB008218 . 1 1 AB008218S1 



Homo sapiens partial K. 
Homo sapiens gene for . 



gi 
gi 
gi 



gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 



18413572 
18873687 
29648446 
22651882 
22347797 
11230634 



emb | AJ428930 . 1 |XLA428930 
emb | AJ272516 . 1 |HSA272516 



gb 
gb 
gb 



AY190007.1 
AF291761.1 
AF532732.1 



Xenopus laevis partial . . 
Homo sapiens partial K. 



Pan paniscus clone BoE26-M13R hi. . 
Ipomoea batatas S-adenosylmethio! 
am i,i S T, 00 ,.." i Danio r erio mx gene, promoter se. ! 
emb|AJ289159.1 HSA289159 Homo saniens cmn o»t,« 
3342093|gb|AP074905.l|HOMOS L C04 Homo sapiens neurWHS* " 
^^loot 1 ^ 1 ^ 325198 - 1 !^ 325198 Triticum aestr^m ££i4 
6649909|gb|AF026274.1|AP026274 Mus musculus SumTko ^umi* ' 



gi|622479l|gb|AF190816.1|AF190816 Homo sapiens complement f... 
^^tofi 9b| ^ 035664 ' l|H f TGFRB13 Homo sa P^ns transformin 



gi 
gi 
gi 
gi 
gi 
gi 

gi 
gi 
gi 



21213045 
15638904 
15638822 
13793999 
11093799 



emb|AJ487974 .1|EAM487974 Enterobacter^igenus*: 
Homo sapiens BAC clone RP11-6430. 
Homo sapiens BAC clone RP11-345m! 
Pinus taeda isolate PTLTP3-iii27 * 
AF285184 Mus musculus basic trans! 

2984654|gb|AP039088.l|AP039088 Homo saoiens non 



gb 
gb 
gb 
gb 



AC024246.8 
AC079120.6 
AY02 9002.1 
AF285184.1 



68 


2e-08 


66 


le-07 


66 


• le-07 


66 


le-07 


66 


le-07 


66 


le-07 


64 


4e-07 


64 


4e-07 


64 


4e-07 


64 


4e-07 


64 


4e-07 


64 


4e-07 


62 


2e-06 


60 


6e-06 


60 


6e-06 


60 


6e — 06 


60 


6e-06 


60 


6e-06 


60 


6e-06 


60 


6e-06 


58 


2e-05 


58 


2e-05 


58 


2e-05 


56 


9e-05 


56 


9e-05 


56 


9e-05 


54 


4e-04 


54 


4e-04 


54 


4e-04 


52 


0.001 


52 


0.001 


52 


0.001 


52 


0. 001 


50 


0.006 


48 


0.023 


48 


6.023 


48 


0.023 


48 


0.023 


48 


0.023 


46 


0.090 


46 


0.090 


46 


0 . 090 


46 


0.090 


4€ 


0.090 


44 


0.36 


44 


0.36 
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gi|2281684|gb|AF009433.l|AH006708S06 Homo sapiens clone 22 



91 

gi 
gi 
gi 
gi 
gi 
gi 



13936217 
13936206 
13936204 
13794006 
13794003 
24210408 
13928026 



|gb AY029021.1 
jgb AY029014.1 
jgb AY029013.1 
jgb AY029007.1 
|gb AY029005.1 
|emb|AJ320160.i 
|emb|AL121594.6 



Pinus radiata isolate PRLTP3-iii. 
Pinus taeda isolate PTLTP9-iiiis . 
Pinus taeda isolate PTLTP4-i360 
Pinus taeda isolate PTLTP3-v94 n. 
Pinus taeda isolate PTLTP6-1425 
|FNU32O160 Pusobacterium nucleatu! 
(CNS01DRY Human chromosome 14 DNA. 



gi|8191116|gb AC040163.3 
gi j 6707080 jgb AF139182.1 
gij 4028938 jgb AC004230.1 
gi | 21623971 1 dbj |AP001094 
gi 1 19879812 j dbj j AP001363 



AC040163 Homo sapiens chromosome 1. 

AF13 9182 Bartonella henselae filam. 

AC004230 Homo sapiens Chromosome 1. 

.6 1 Homo sapiens genomic DNA, chrom. 

.4 1 Homo sapiens genomic DNA, chrom. 

Alignments 



44 


0 


.36 


44 


0 


.36 


44 


0 


.36 


44 


0 


.36 


44 


0 


.36 


44 


0 


.36 


44 


0 


-36 


44 


0 


.36 


44 


0 


.36 


44 


0 


.36 


44 


0 


.36 


44 


0 


.36 


44 


0 


.36 



>gi | 18868 | emb |X13443 . 1 |HVAATI 
inhibitor 



Barley mRNA for alpha- amylase/ trypsin 



Length 



672 



Score =81.8 bits (41), Expect 
Identities m 41/41 (100%) 
Strand = Plus / Plus 



2e-12 



Query: 793 aagagattgaaccaacgaccaataaactagtatcaacaata 833 

sbict x MIIIIMIII Illllllllllllllllllll 

strjct: 1 aagagattgaaccaacgaccaataaactagtatcaacaatg 41 



>gi|22035404|gb|AY124482.l| Danio rerio myogenin gene, exons 1, 2 and 3 

and complete cds 

Length =4260 

Score = 79.8 bits (40), Expect = 6e-12 
Identities = 40/40 (100%) 
Strand = Plus / Minus 

Query: 3 tcgattactatagggcacgcgtggtcgacggcccgggctg 42 

Sbict- 4246 U'lU'lll'IIIIIINIIMIIIIIIIIIIIIIIIII 
SD^ct. 4246 tcgattactatagggcacgcgtggtcgacggcccgggctg 4207 



>gi | 24614584 |gb | AY050653.il Giardia intestinal is SEC24 (SEC24) mRNA, 
complete cds 

Length « 1420 

Score = 75.8 bits (38), Expect = le-10 
Identities = 38/38 (100%) 
Strand = Plus / Minus 

Query: 5 gattactatagggcacgcgtggtcgacggcccgggctg 42 

Sbict- 14X7 ■ ■ A ■ lllllllllllllllllllllillllllllllll 

& ^ ' 1417 g attac tatagggcacgcgtggtcgacggcccgggctg 13 80 
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Score = 67.9 bits (34), Expect » 2e-08 
Identities « 34/34 (100%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

, . h , .iiiiiiiiiiiiiiiiiiiiiiiiiiniiiiii 

Sbjct: 23 actatagggcacgcgtggtcgacggcccgggctg 56 



>gi|l093.8038|gb|AF162890.l|AP162890Sl Mus musculus peroxisomal assembly 
protein PEX3P (Pex3) gene, 

promoter and exon 1 

Length « 2785 

Score =75.8 bits (38), Expect = le-10 
Identities a 38/38 (100%) 
Strand = Plus / Plus 

Query: 5 gattactatagggcacgcgtggtcgacggcccgggctg 42 

minim iiiiiimiifiiiiiiiiiii 

Sbj ct: 13 gattactatagggcacgcgtggtcgacggcccgggctg 50 



>gi | 5821237 |dbj|AB021922.l| Homo sapiens gene for lectin-like oxidized LDL 
receptor, promoter 
region 

Length = 2463 

Score « 75.8 bits (38), Expect = le-10 
Identities = 38/38 (100%) 
Strand ■ Plus / Plus 

Query: 5 gattactatagggcacgcgtggtcgacggcccgggctg 42 

OK' , „ "UHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

SJrjct: 13 gattactatagggcacgcgtggtcgacggcccgggctg 50 



>gi 1 3721562 |dbj |AB011276.l| Mus musculus gene for alphala calcium 

channel, partial cds 

Length = 2099 

Score m 73.8 bits (37), Expect = 4e-10 
Identities a 40/41 (97%) 
Strand « Plus ./ Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctgttattgg 49 

lllllllllllllllllllllllllillllllli Mil 

Sb}ct: 1 actatagggcacgcgtggtcgacggcccgggctggtattgg 41 
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>gi|30313388|gb|AY099112.l| Rattus norvegicus obese protein gene, 5 
flanking region and 

partial cds 

Length - 1127 

Score - 71.9 bits (36), Expect = 2e-09 
Identities = 36/36 (100%) 
Strand « Plus / Plus 

Query: 7 ttactatagggcacgcgtggtcgacggcccgggctg 42 

iiiiiiiiiiniiiiiiiiiiiiiiiiiiiiiiii 

SbDct: 17 ttactatagggcacgcgtggtcgacggcccgggctg 52 



>gi|l8140057|gb|AF457660.l| Castanea dentata clone ACS 2 vascular protein 
gene, promoter 

region 

Length = 685 

Score = 71.9 bits (36), Expect « 2e-09 
Identities = 36/36 (100%) 
Strand = Plus / Plus 

Query: 7 ttactatagggcacgcgtggtcgacggcccgggctg 42 

mi minium! mini in mm 

Sbjct: 9 ttactatagggcacgcgtggtcgacggcccgggctg 44 



>gi|4104807|gb|AF039526.l|AF039526 Homo sapiens MHC class I related protein 
1 (MR1) gene, partial 
cds 

Length =1423 

Score = 71.9 bits (36), Expect = 2e~09 
Identities = 36/36 (100%) 
Strand « Plus / Plus 

Query: 7 ttactatagggcacgcgtggtcgacggcccgggctg 42 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiini 

Sbjct: 4 ttactatagggcacgcgtggtcgacggcccgggctg 39 



>gi|8050595|gb|AP233737.l|AF233737 Agrotis ipsilon AiC5 chymotrypsinogen 
gene, "promoter region 
Length = 951 

Score = 71.9 bits (36), Expect = 2e-09 
Identities = 36/36 (100%) 
Strand « Plus / Plus 

Query: 7 ttactatagggcacgcgtggtcgacggcccgggctg 42 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin 

Sb^ct: 11 ttactatagggcacgcgtggtcgacggcccgggctg 46 
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>gi|6690643|gb|AF191544.l|AF191544 Homo sapiens estrogen receptor beta 
gene, promoter region and p Deta 

partial cds 

Length =2495 

Score = 71.9 bits (36), Expect = 2e-09 . 
Identities = 39/40 (97%) 
Strand = Plus / Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctgttattQ 48 

^ 1 1 HI! 1 1 III I III II llll II II II I III III Mill 

strict: 1 actatagggcacgcgtggtcgacggcccgggctggtattg 40 



>gi 1 6164589 |gb|AF051769.l|AF051769 Homo sapiens hyaluronidase-like 
pseudogene 1 (HYALP1) , partial sequence 
Length = 11957 

Score = 71.9 bits (36), Expect = 2e-09 
Identities = 36/36 (100%) 
Strand = Plus / Minus 

Query: 7 ttactatagggcacgcgtggtcgacggcccgggctg 42 

sb-ic t . IIHIINIIIIIIIIIIIIIIIIMIIIIIIIIII 

strjct. 11954 ttactatagggcacgcgtggtcgacggcccgggctg 11919 



promoter region, 5'UTR, and partial cds 
Length = 1094 

Score « 71.9 bits (36), Expect = 2e-09 
Identities = 36/36 (100%) 
Strand = Plus / Plus 

Query: 7 ttactatagggcacgcgtggtcgacggcccgggctq 42 

sh i Ct iiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiii 

oay cc : 64 ttactatagggcacgcgtggtcgacggcccgggctg 99 



>gi|l991940l|gb|AF435445.l| Pleurotus ostreatus manganese peroxidase (mnp3) 
gene, promoter r 

region and partial cds 
Length a 2790 

Score = 69.9 bits (35), Expect - 6e-09 
Identities = 35/35 (100%) 
Strand « Plus / Plus 

Query: 8 tactatagggcacgcgtggtcgacggcccgggcta 42 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

SJD}ct: 79 tactatagggcacgcgtggtcgacggcccgggctg 113 
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>gi|8132114|gb|AP153 014.l| Trichomonas vaginalis Tvpl4 (tvpl4) gene, 
complete cds 

Length = 1463 

Score « 69.9 bits (35), Expect = 6e-Q9 
Identities = 35/35 (100%) 
Strand = Plus / Plus 



Query: 8 tactatagggcacgcgtggtcgacggcccgggctg 42 

Mill I lllllllll lllllllllllllllll III 

Sbjct: 11 tactatagggcacgcgtggtcgacggcccgggctg 45 



>gi| 4456992 |gb|AF077743.l|AF077743 Mus musculus transcription factor TFEC 

gene, promoter region and 
.5' UTR 

Length « 615 

Score ■ 69.9 bits (35) , Expect = 6e-09 
Identities « 38/39 (97%) 
Strand = Plus / Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctgttatt 47 

Mill III IIIIN Mill III IMIIMIIill llll 

Sb j ct : 1 act atagggcacgcgtggt cgacggcccgggctggtatt 3 9 



>gi|l2276207|gb|AF269146.l|AF269146 Bilophila wadsworthia taurine : pyruvate 
aminotransferase gene, 

complete cds 
Length » 2050 

Score = 69.9 bits (35), Expect = 6e-09 
Identities = 35/35 (100%) 
Strand = Plus / Minus 

Query: 8 tactatagggcacgcgtggtcgacggcccgggctg 42 

llllllllllll Mill IMIIMIIII MM Ml 

Sbjct: 2050 tactatagggcacgcgtggtcgacggcccgggctg 2016 



>gi|l4275833|enib|AJ289605.l|MMU289605 Mus musculus partial Lancll gene 

for LanC-like protein 1, exon 4 
Length = 682 

Score « 69.9 bits (35), Expect = 6e-09 
Identities « 35/35 (100%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctgt 43 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

Sbjct: 1 actatagggcacgcgtggtcgacggcccgggctgt 35 
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Score = 67.9 bits (34), Expect « 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

MIMM I MM MIM I lllllll I II III II I 

Sbjct: 46 actatagggcacgcgtggtcgacggcccgggctg 79 



>gi 1 3378604 | emb | AJ009889 . 1 1 HSAJ9889 Homo sapiens GHR gene, VI promoter 
region 

Length = 1640 



Score = 69.9 bits (35), Expect = 6e-09 
Identities = 38/39 (97%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctgttatt 47 

MIMM IIMIIIMII IMIIMIMI Mill Mil 

Sbjct: 1 actatagggcacgcgtggtcgacggcccgggctggtatt 3 9 



>gi| 3916231 |gb|AF074397.l|AF074397 Homo sapiens anti-mullerian hormone type 
II receptor (AMHR2) 

gene, promoter region and partial cds 

Length « 1135 



Score = 69.9 bits (35), Expect = 6e-09 
Identities « 38/39 (97%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctgttatt 47 

MMIIIIIMIMII Mill MMMM MM I Mil 

Sbjct: 13 actatagggcacgcgtggtcgacggcccgggctggtatt 51 



>gi|5139506|eTtibjzi8892.2|MMDESMINP Mus musculus desmin gene 

Length = 19391 

Score = 69.9 bits (35), Expect = 6e-09 
Identities = 35/35 (100%) 
Strand = Plus / Minus 



Query: 8 tactatagggcacgcgtggtcgacggcccgggctg 42 

Mlllllllllll llllllllllllllllllllll 

Sbjct: 19391 tactatagggcacgcgtggtcgacggcccgggctg 19357 



14/21 

>gi 1 15487305 |dbj|AB060299.l| M us musculus gene for acetyl CoA 

synthethase-1, promoter region, 

partial sequence 

Length = 2094 

Score = 69.9 bits (35), Expect - 6e-09 
Identities = 38/39 (97%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctgttatt 47 

IIIIMIIIIIIIIIIIIIllllllllllllll! 1 1 1 1 

Sbjct: 1 actatagggcacgcgtggtcgacggcccgggctggtatt 39 



>gi 1 12697590 | dbj |AB046716.l| Homo sapiens hST3Gal I gene for alpha 2,3- 
sialyltransf erase I, 

promoter region 

Length = 1950 



Score o 69.9 bits (35), Expect = 6e-09 
Identities = 38/39 (97%) 
Strand » Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctgttatt 47 

i Minim ii mm minium mi i mi i mi 

Sbjct: 1 actatagggcacgcgtggtcgacggcccgggctggtatt 3 9 



>gi| 25453365 | gb|AY050651. 2 | Giardia intestinalis MYB (MYB) mRNA, complete 
cds 

Length « 3069 

■ 

Score « 67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Minus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

I Mill I Ml MINI IMI I III Mil I II II I 

Sbjct: 3069 actatagggcacgcgtggtcgacggcccgggctg 3036 



Score =52.0 bits (26), Expect = 0.001 
Identities = 26/26 (100%) 
Strand = Plus / Plus 



Query: 17 gcacgcgtggtcgacggcccgggctg 42 

MM I III II M MM II Ml U 1 1 

Sb^ct: 1 gcacgcgtggtcgacggcccgggctg 26 
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>gi|l3183059|gb|AF237414.l| Anaplasma phagocytophilum RNA polymerase beta 
subunit (rpoB) gene, 

complete cds 
Length « 4185 

Score o 67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Minus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

II III I MM II I II 1 1 III II 1 1 II I III Ml I 

Sb^ct: 4041 actatagggcacgcgtggtcgacggcccgggctg 4008 



>gi|l3625520|gb|AY014277.l| Lolium perenne gibberellin 20-oxidase gene, 
complete cds 

Length =2128 

Score = 67.9 bits (34) , Expect = 2e-08 
Identities - 34/34 (100%) 
Strand « Plus / Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

Ml IN I III 1 1 1 1 1 1 1 1 1 IN II MM I MM I 

Sbjct: 15 actatagggcacgcgtggtcgacggcccgggctg 4 8 



>gi|l7105179|gb|AF439558.l|AF439558 Mus musculus X2CR1 gene, promoter 
region and partial cds 
Length « 830 

Score = 67.9 bits (34), Expect = 2e~08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

• IIIIIIIIIIIIIIIIIIIMMIIIMIIIIII 

Sb;jct: 1 actatagggcacgcgtggtcgacggcccgggctg 34 



>gi|l6209547|gb|AY052528 .l| Glycine max calmodulin isoform-4 (cam-4) gene, 
promoter region 

and partial cds 

Length = 2050 

Score = 67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand « Plus / Plus 



Query: 5 gattactatagggcacgcgtggtcgacggcccgg 3 8 

MIMIMMIMMIIIMMMMIIIIIMI 

Sbjct: 13 gattactatagggcacgcgtggtcgacggcccgg 46 
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>gx|l5213480|gb|AF241535.l|AP241535 Homo sapiens mucin 4 (MUC4) gene 
promoter sequence and partial 



cds 

Length = 3716 

Score m 67.9 bits (34), Expect » 2e-08 
Identities « 34/34 (100%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

sbiet , IIMIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

srrjct: l actatagggcacgcgtggtcgacggcccgggctg 34 



>gi|l5558849|emb|AJ310936.l|FSO310936 Fusarium solani chy gene for putative 
cyanide hydratase enzyme putative 

Length = 1981 

Score = 67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand « Plus / Minus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 
sbHc, IIIIIIMIIIIIIIIIIIMIIIMIIIlTlll 
Sbjct: 1981 actatagggcacgcgtggtcgacggcccgggctg 1948 



>gi|4878023|gb|AF131239.2|AF131239 Rattus norvegicus alpha 1,2- 

fucosyl transferase C (FTC) gene, complete 
cds 

Length « 1555 

Score « 67.9 bits (34), Expect = 2e-08 
. Identities = 34/34 (100%) 
Strand =. Plus / Minus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

Sbict 1534 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I TTT I I T 

strjct: 1534 actatagggcacgcgtggtcgacggcccgggctg 1501 



>gi 1 15216031 |emb|AJ318812.l|VFA318812 vicia faba var. minor aapl gene 
promoter region * y ' 

Length = 17 02 

Score = 67.9 bits (34), Expect « 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

sbict i "lillllllMIIIIIMIMIIIIIIiillll 

snjct: l actatagggcacgcgtggtcgacggcccgggctg 34 
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> 3 i|4454294|emb|AJ132779.l|MMUl32779 Mus musculus pi 0 7 gene promoter region 
Length. = 776 ~ " ^ 

Score =67.9 bits (34), Expect « 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus . 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

sh . , , iiiiiiiiniiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 1 actatagggcacgcgtggtcgacggcccgggctg 34 



>gi {3133088 | emb | Y16736 . 1 1 HSA16736 Homo sapiens dif-2 gene, promoter region 



Length « 1368 

Score = 67.9 bits (34), Expect = 2e~08 
Identities m 34/34 (100%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

qh . , „ iiiMiiiiiiiiiiimiiiiiiiiiiiiiif 

Sbjct: 11 actatagggcacgcgtggtcgacggcccgggctg 44 



>gi|669052l|gb|AP154245.l|AF154245 Rattus norvegicus chemotactic protein-3 
gene, complete cds 

Length = 2416 

Score = 67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

Bh . , iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 1 actatagggcacgcgtggtcgacggcccgggctg 34 



>gi|5731977igb|AF114032.l|AP114032 Mus musculus glycogenin-1 gene, 

promoter and partial cds 
Length = 1958 

Score « 67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

sbict , iiiriiiiiiiiiiiiiiiiiiiiiiiiiiiin 

sirjct: 1 actatagggcacgcgtggtcgacggcccgggctg 34 



>gi|26453412|dbj|A3094665.l| Seriola quinqueradiata YGHL1 gene for putative 
growth hormone 

like protein- 1, complete cds 

Length = 6658 
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Score = 67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

MMIIIHIIMIIIIIIIIIIIIIIlTTllll 

Sbjct: 39 actatagggcacgcgtggtcgacggcccgggctg 72 



Itoltlll^llt l^ 099083 -MAF099083 Homo sapiens growth hormone secretagogue 

flanking region and partial cds 
Length =1237 

Score =67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

sbict. i lUIil 1 < 1 1 n 1 1 1 1 1 n i m 1 1 1 1 1 iTTTi i? 

Sbuct: 1 actatagggcacgcgtggtcgacggcccgggctg 34 



geni? 7 exon 3 i 9b|AP ° 29214 - l|MM0X2S1 MUS mUSCUlus m OX " 2 ^igen homolog 
Length = 2791 

Score =67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Minus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

sbict i i i i i i i i i i i i i i i i i i i i i i i i i i i i TTT i i T 

strict: 2791 actatagggcacgcgtggtcgacggcccgggctg 2758 



Score = 67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 



Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

sbH.t n IIIIIIIIIIIIIMIIliliiillllllMIll 

strjct: l actatagggcacgcgtggtcgacggcccgggctg 34 

^«""iSi^r(^, l * PMS ' 16 «— «— «ave,a=i.„ B ^ glutamlne 
gene, complete cds 
Length = 2685 

Score =67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Minus 
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Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

ok- in ii 1 1 ill imri-MPimn] 

Sb 3 ct: 2685 actatagggcaogcgUgiigaiggiii^ij 2652 



Score =58.0 bits (29), Expect = 2e-05 
Identities = 29/29 (100%) 
Strand = Plus / Plus 

Query: 14 agggcacgcgtggtcgacggcccgggctg 42 

sb1ct x i i i i i i i i i i i i i i i i i i Tf i i i TTT i TT 

Sbjct: 1 agggcacgcgtggtcgacggcccgggctg 29 

Sf^f 83,9b|U53907 ' l|RNU539 ° 7 Rattus -rvegicus microsatellite sequence 
Length =498 

Score = 67.9 bits (34) , Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 

Query: 9 actatagggcacgcgtggtcgacggcccgggctg 42 

SMefc * ININHIIIIIIIIIII III III || III || | | 

Sbuct: 5 actatagggcacgcgtggtcgacggcccgggctg 38 

>gi | 6683556 idbj |AB024534.l| Rattus norvegicus gene for thiazide-sensitive 

cotransporter, 5« flanking region 
Length =2145 

Score =67.9 bits (34), Expect = 2e-08 
Identities = 34/34 (100%) 
Strand = Plus / Plus 

ZZ I IE" f " n iff!??? - 

Sb]ct: 1 actatagggcacgcgtggtcgacggcccgggctg 34 

ffiSSIg'ISKS-SL. Rittus norve9i=ue 

complete cds 
Length =6359 

Score = 65.9 bits (33), Expect = le-07 
Identities = 33/33 (100%) 
Strand = pi us / Minus 



n 20/21 
Query: 7 ttactatagggcacgcgtggtcgacggcccggg 39 

Sb 3ct: 6349 ttac^aVaggg^gcgtgg^cgaiggiiiggg 6317 



^"JSS'SiS^r HOm ° SaPiSnS Partlal1 ^ ^P^-ted CHRNA7 
and partial exon 5 
Length = 1280 

Score = 65.9 bits (33), Expect = le-07 
Identities = 36/37 (97%) 
Strand = Plus / Plus 

Query: H tatagggcacgcgtggtcgacggcccgggctgttatt 47 

sbict. fi '111111111111111111111111 llll 

Sb 3 ct: 8 tatagggcacgcgtggtcgacggccpgggctggtaii 44 
>gi 1 5002511 1 eitlb 1 298266.1 1 HSZ98?fifi tj«~~ 

(exons 1-13) ' 56 - 1 l H SZ98266 Homo sapiens gene encoding plakophilin 

Length = 49999 

Score = 65.9 bits (33), Expect = le-07 
Identities = 33/33 (100%) 
Strand = Plus / Plus 

Query; lo ctatagggcacgcgtggtcgacggcccggcjcta 42 

sbict- 16 U'i ,ll,l,,ll,lll ' , ''IIIIIIIIIMI 

Sb 3 ct: 16 ctatagggcacgcgtggtcgacggcccgggctg 48 

Length = 1605 

Score = 65.9 bits (33), Expect = le-07 
Identities = 37/39 (94%) 
Strand = Plus / Plus 

Query: 9 a ^atagggcacgcgtggtcgacggcccgggctgttatt 47 

bm*. -, IIIIIIHIIIIIMIIII IN Mill | | | | 

Sb 3 ct: 1 actatagggcacgcgtggtcgacggccckggciggtati 39 

>gi| 2 4475537|db,, f 084484.1, Betula platyphylla DNA, .icrosatellite :Bp A 

Score « 65.9 bits (33), Expect = le-07 
Identities = 33/33 (100%) 
Strand = Plus / Minus 
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Query: 10 ctatagggcacgcgtggtcgacgg CCC g qq cta 42 

S bi Cfc ,„ iiiiiiiiiiiiiiiiiiiii i 1T1TTTTTTTT 

Sb D ct: 317 ctatagggcacgcgtggtcgacg^iiig^ig 28s 

(FREP12.1) gene, partial cds 
Length = 609 

Score =63.9 bits (32), Expect = 4e-07 
Identities = 32/32 (100%) 
Strand = Plus / Plus 



Query: li tatagggcacgcgtggtcgacggcccgggctg 42 

'I III II III 
:ccgggctg 32 



sbnct r ■■■■■■ i ■ i i i iTTi iTi iTTTTTTTT 7 i 

Sbjct: l tatagggcacgcgtggtcgacggcc--—— 



region 

Length = 1157 

Score =63.9 bits (32), Expect = 4e-07 
Identities . 35/36 (97%) 
Strand = Plus / plus 

Query: 7 ■-fctabtatagggcacgcgtggtogacggeccgggctg 42 

Length = 1023 

Score =63.9 bits (32), Expect = 4e-07 
Identities = 33/34 (97%) 
Strand = Plus / Minus 



Query: 9 ^ftagggcacgcgtggtcgacggcccgggc 
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